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Lactobacilli Expressing Biologically Active Polypeptides and Uses Thereof 

CROSS REFERENCE TO RELATED APPLICATIONS 

[01] The present application claims priority to U.S. Provisional Patent 

Application No. 60/362,945, filed on March 8, 2002, which is incorporated by reference for 

all purposes. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 
[02] This invention was made with Government support under SBIR phase 
I Grant No. 1 R43 AI46403-01 . The government has certain rights in this invention. 

BACKGROUND OF THE INVENTION 
[03] The mucosal membranes of all humans are naturally colonized by 
bacteria. Recent scientific evidence has documented the fact that these bacteria interact 
closely with cells and tissues of the body to regulate natural biological processes. It has 
become increasingly evident that this mucosal microflora also contributes substantially to 
numerous diseases affecting cells and tissues of humans. 

[04] Generally, domination of the microflora within the vagina and 
gastrointestinal tract, by lactobacilli and related bacteria, is associated with good health. 
Natural strains of lactobacilli have been administered for many years as "probiotics" for the 
purpose of maintaining a healthy microflora within these locations and preventing infection. 
It is well established that these "healthy bacteria" compete with pathogenic organisms, such 
as bacteria, viruses and fungi to limit the development and progression of pathogen 
associated diseases. Nevertheless, this microflora is a fragile and dynamic environment with 
the natural turnover and disruption of the healthy microflora being associated with the 
establishment of opportunistic infections. Consequently, approaches to maintain, or even 
enhance, the integrity and natural properties of the microflora, as a means of preventing or 
treating disease, would be coveted by the biomedical community. 

[05] Currently, the predominant mode of transmission of viruses, like 
human immunodeficiency virus (HIV) and herpes simplex virus type 2 (HSV-2), is via 
heterosexual contact. Women are particularly at risk for infection by sexually transmitted 
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viruses, as the efficiency of HTV and HSV-2 transmission from male to.female.is greater than 
for the reverse (al-Nozha et al., J- Acqitir. Immune. Defic. Syndv. 3:193-194 (1990); Wald et 
al, JAMA 285:3100-3 106 (2001)). The first natural line of defense against disease 
transmission for women during heterosexual contact is the mucosal barrier in the vagina. In 
healthy American and European women of childbearing age, either Lactobacillus crispatus or 
L.jensenii are most commonly the predominant bacteria colonizing the vaginal mucosa 
(Antonio et aU J Infect. Dis. 180:1950-1956 (1999); Vasquez et al, J- Clin. Microbiol. 
40:2746-2749 (2002)). These lactobacilli secrete lactic acid as a byproduct of metabolism, 
serving to acidify the vaginal mucosa and to keep the numbers of competing pathogenic 
microorganisms relatively low. Additionally, the predominance of lactobacilli in the vaginal 
microflora of healthy women has been associated with a lowered incidence of HIV 
acquisition, perhaps correlating with the observed antagonistic effects of metabolic 
byproducts of lactobacilli (lactic acid and peroxide), on the infectivity of the HIV virus in 
vitro. Therefore, the association of beneficial lactobacilli with the vaginal mucosa can be 
considered together as a protective "biofilm". Sexually transmitted viruses like HIV and 
HSV-2 must first successfully traverse this protective matrix before infecting a host and 
causing disease. 

[06] The mucosal microflora contributes to many other local diseases 
affecting mucosal surfaces. For instance, the etiology of inflammatory bowel diseases, 
including ulcerative colitis and Crohn's disease may arise from inappropriate interactions 
between a disrupted mucosal microflora and cells and tissues of the host. A means of 
modulating the properties of bacteria within the mucosal flora could aid in the prevention and 
or treatment of these diseases, as well as related conditions affecting mucosal surfaces. 

[07] The present invention addresses these and other problems. 

BRIEF SUMMARY OF THE INVENTION 
[08] The present invention provides mucosal surface-colonizing 

Lactobacillus jensenii bacteria recombinant^ altered to express a biologically active protein. 

In some embodiments, the mucosal surface resides within the vagina. In some embodiments, 

the mucosal surface resides within the gastrointestinal tract. In some embodiments, the 

mucosal surface is a human mucosal surface. 

[09] In some embodiments, the protein requires a disulfide bond to be 

biologically active. In some embodiments, the biologically active protein is secreted. 
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[10| In some embodiments, the biologically active protein is on the 
bacterial surface. In some embodiments, the protein comprises a signal sequence selected 
from the group consisting of SEQ.ID NO:l, SEQ ID NO:2 and SEQ ID NO:3. hi some 
embodiments, the biologically active protein comprises a cell wall anchoring signal sequence. 
In some embodiments, the cell wall anchoring signal sequence comprises SEQ ID NO:4 or 
SEQIDNO:5. 

[11] In some embodiments, the biologically active protein binds to a 
pathogen when the biologically active protein is contacted with a pathogen. In some 
embodiments, the pathogen is a bacterial pathogen. In some embodiments, the pathogen is a 
fungal pathogen. In some embodiments, the pathogen is a viral pathogen. 

[12] In some embodiments, the viral pathogen is HIV. In some 
embodiments, the biologically active protein is CD4 or an HlV-binding fragment of CD4. In 
some embodiments, the biologically active protein is two-domain CD4 (2D-CD4). In some 
embodiments, the biologically active protein is cyanovirin-N or a virus-binding fragment of 
cyanovirin-N. In some embodiments, the viral pathogen is HSV. In some embodiments, the 
biologically active protein binds to HSV. In some embodiments, the protein is herpes virus 
entry mediator C (HveC) or a virus-binding fragment of HveC. 

[13] In some embodiments, the biologically active protein is a therapeutic 
polypeptide. In some embodiments, the biologically active protein is a polypeptide vaccine. 

[14] In some embodiments, the biologically active protein has a molecular 
weight of between 500 and 40,000 daltons. 

[15] The invention also provides methods of providing a biologically active 
protein to a mammalian mucosal surface. In some embodiments, the methods comprise, 
contacting the mucosal surface with Lactobacillus jensenii bacteria recombinantly altered to 
express a biologically active protein in an amount able to be detected in a sample collected 
from the mucosal surface. In some embodiments, the mucosal surface resides within the 
vagina. In some embodiments, the mucosal surface resides within the gastrointestinal tract. 

[16] In some embodiments, the contacting step comprises orally 
administering the Lactobacillus jensenii bacteria. In some embodiments, the contacting step 
comprises rectally administering the Lactobacillus jensenii bacteria. In some embodiments, 
the contacting step comprises vaginally administering the Lactobacillus jensenii bacteria. 

[17] The present invention also provides expression cassettes comprising a 
polynucleotide encoding a heterologous bacterial signal sequence substantially identical to 
SEQ ID NO:2. In some embodiments, the expression cassette comprises a polynucleotide 
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encoding a polypeptide, wherein the amino-tenninus of the polypeptide comprises a 
heterologous bacterial signal sequence at least 95% identical to SEQ ID NO:2. In some 
embodiments, the bacterial signal sequence comprises SEQ ID NO:2. 

[IS] The present invention also provides vectors comprising the expression 
cassette and host cells comprising such vectors. In some embodiments, the cell is a gram- 
positive bacterium. 

DEFINITIONS 

[19] A "vaginal colonizing bacteria" refers to bacteria that naturally 
colonize the vagina or recombinantly altered strains of such bacteria. Examples of vaginal 
colonizing bacteria include, e.g., L. jeiisenii and L. crispatus. 

[20] A "biologically active protein" refers to an amino acid sequence that 
has the biological activity (i.e., can participate in the molecular mechanisms) of the amino 
acid sequence within, or outside of, a native cell. Activity of a protein includes, e.g., its 
immunogenicity. Therefore, polypeptide vaccines are biologically active proteins. Typically, 
the amino acid sequence forms the three-dimensional structure formed by the amino acid 
sequence within or outside of the native cell. 

[21] A polypeptide "associated with a bacterial surface" refers to a 
polypeptide attached or imbedded in a bacterial membrane or cell wall matrix or biofilm. 

[22] A "pathogen" refers to a virus, bacteria, fungus, protozoa or other 
microscopic organism that causes disease of harm to a host. 

[23] "2D-CD4" refers to the first approximately 1 83 amino acids of 2- 
domain CD4. CD4 is a cell-surface glycoprotein found on the mature helper T cells and 
immature thymocytes, as well as monocytes and macrophages. 2D-CD4 binds to HIV-1 
gpl20 with the same affinity as the intact protein (Salzwedel et aL 9 J. Virol. 74:326-333 
(2000)), and contains the binding site for gpl20. CD4 contains an amino-terminal 
extracellular domain (amino acid residues 1 to 371), a transmembrane region (372 to 395) 
and a cytoplasmic tail (396 to 433). 

[24] A bacterial "biofilm" refers to a complex network of different bacteria 
and extracellular matrix materials secreted by the bacteria. 

[25] The term "gene" means the segment of DNA involved in producing a 
polypeptide chain; it includes regions preceding and following the coding region (leader and 
trailer). 
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[26] A "therapeutic protein" or "therapeutic polypeptide" refers to a 
polypeptide possessing biological activity that can be used for the prevention and/or 
treatment of disease. Examples of therapeutic polypeptides include those capable of 
preventing, inhibiting, stabilizing or reversing an inherited or noninherited genetic defect in 
metabolism, immune regulation, hormonal regulation, enzymatic or membrane associated 
structural function. For example, therapeutic protein can replace an absent or defective 
cellular protein or enzyme, or supplement production of a defective or low expressed cellular 
protein or enzyme. 

[27] "Antibody" refers to a polypeptide substantially encoded by an 
immunoglobulin gene or immunoglobulin genes, or fragments thereof which specifically bind 
and recognize an analyte (antigen). The recognized immunoglobulin genes include the 
kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as the 
myriad immunoglobulin variable region genes. Light chains are classified as either kappa or 
lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 

[28] An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kDa) and one "heavy" chain (about 50-70 kDa). The N- 
terminus of each chain defines a variable region of about 100 to 1 10 or more amino acids 
primarily responsible for antigen recognition. The terms variable light chain (V L ) and 
variable heavy chain (V H ) refer to these light and heavy chains respectively. 

[29] Antibodies exist, e.g., as intact immunoglobulins or as a number of 
well characterized fragments produced by digestion with various peptidases. Thus, for 
example, pepsin digests an antibody below the disulfide linkages in the hinge region to 
produce F(ab')2 3 a dimer of Fab which itself is a light chain joined to Vh-C h 1 by a disulfide 
bond. The F(ab') 2 may be reduced under mild conditions to break the disulfide linkage in 
the hinge region, thereby converting the F(ab')2 dimer into an Fab 1 monomer. The Fab 1 
monomer is essentially an Fab with part of the hinge region (see, Paul (Ed.) Fundamental 
Immunology, Third Edition, Raven Press, NY (1993)). While various antibody fragments are 
defined in terms of the digestion of an intact antibody, one of skill will appreciate that such 
fragments may be synthesized de novo either chemically or by utilizing recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies or those synthesized de novo using 
recombinant DNA methodologies (e.g., single chain Fv). 
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[30] The term "isolated," when applied to a nucleic acid or protein, denotes 
that the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
performance liquid chromatography. A protein that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated gene is separated from open 
reading frames that flank the gene and encode a protein other than the gene of interest. The 
term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an 
electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, 
more preferably at least 95% pure, and most preferably at least 99% pure. 

[31] The term "nucleic acid" or "polynucleotide" refers to 
deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double- 
stranded form. Unless specifically limited, the term encompasses nucleic acids containing 
known analogues of natural nucleotides that have similar binding properties as the reference 
nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. 
Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses 
conservatively modified variants thereof (e.g., degenerate codon substitutions) and 
complementary sequences as well as the sequence explicitly indicated. Specifically, 
degenerate codon substitutions may be achieved by generating sequences in which the third 
position of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer et al, Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al. 9 J. 
Biol Chem. 260:2605-2608 (1985); andRossolini etal.,Mol Cell Probes 8:91-98 (1994)). 
The term nucleic acid is used interchangeably with gene, cDNA, and mRNA encoded by a 
gene. 

[32] The terms "polypeptide," "peptide" and "protein" are used 
interchangeably herein to refer to a polymer of amino acid residues. The terms apply to 
amino acid polymers in which one or more amino acid residue is an artificial chemical 
mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring 
amino acid polymers and non-naturally occurring amino acid polymers. As used herein, the 
terms encompass amino acid chains of any length, including full-length proteins (Le. y 
antigens), wherein the amino acid residues are linked by covalent peptide bonds. 

[33] The term "amino acid" refers to naturally occurring and synthetic 
amino acids, as well as amino acid analogs and amino acid mimetics that function in a 
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manner similar to the naturally occurring amino acids. Naturally occurring amino acids .are 
those encoded by the genetic code, as well as those amino acids that axe later modified, e.g., 
hydroxyproline, 7 -carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to 
compounds that have the same basic chemical structure as a naturally occurring amino acid, 
i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R 
group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. 
Such analogs have modified R groups {e.g., norleucine) or modified peptide backbones, but 
retain the same basic chemical structure as a naturally occurring amino acid. "Amino acid 
mimetics" refers to chemical compounds that have a structure that is different from the 
general chemical structure of an amino acid, but which functions in a manner similar to a 
naturally occurring amino acid. 

[34] Amino acids may be referred to herein by either the commonly known 
three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB 
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their 
commonly accepted single-letter codes. 

Two nucleic acid sequences or polypeptides are said to be "identical" if the 
sequence of nucleotides or amino acid residues, respectively, in the two sequences is the 
same when aligned for maximum correspondence as described below. The terms "identical" 
or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, 
refer to two or more sequences or subsequences that are the same or have a specified 
percentage of amino acid residues or nucleotides that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measured using one of 
the following sequence comparison algorithms or by manual alignment and visual inspection. 
When percentage of sequence identity is used in reference to proteins or peptides, it is 
recognized that residue positions that are not identical often differ by conservative amino acid 
substitutions, where amino acids residues are substituted for other amino acid residues with 
similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the 
functional properties of the molecule. Where sequences differ in conservative substitutions, 
the percent sequence identity may be adjusted upwards to correct for the conservative nature 
of the substitution. Means for making this adjustment are well known to those of skill in the 
art. Typically this involves scoring a conservative substitution as a partial rather than a full 
mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an 
identical amino acid is given a score of 1 and a non-conservative substitution is given a score 
of zero, a conservative substitution is given a score between zero and 1. The scoring of 
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conservative substitutions is calculated according to, e.g., the algorithm of Meyers & Miller, 
Computer Applic. Biol Set 4:11-17 (1938) e.g., as implemented in the program PC/GENE 
(Intelligenetics, Mountain View, California, USA). 

The phrase "substantially identical," in the context of two nucleic acids or 
polypeptides, refers to a sequence or subsequence that has at least 70% sequence identity 
with a reference sequence. Alternatively, percent identity can be any integer from 40% to 
100%. More preferred embodiments include at least: 40%, 45%, 50%, 55%, 60%, 65%, 
70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98 or 99% compared to 
a reference sequence (e.g. , SEQ ID NO:2) using the programs described herein, such as 
BLAST using standard parameters, as described below. This definition also refers to the 
complement of a test sequence, when the test sequence has substantial identity to a reference 
sequence. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Default program 
parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 
to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proa Natl 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Accelrys, 
9685 Scranton Road, San Diego, CA), or by manual alignment and visual inspection. 

An example of an algorithm that is suitable for determining percent sequence 
identity and sequence similarity is the BLAST algorithm, which is described in Altschul et 
al., J. Mol. Biol. 215:403-410 (1990). Software for performing BLAST analyses is publicly 
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available through the National Center for Biotechnology Information. This algorithm 
involves first identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive-valued threshold 
score T when aligned with a word of the same length in a database sequence. T is referred to 
as the neighborhood word score threshold (Altschul et al 9 supra). These initial neighborhood 
word hits act as seeds for initiating searches to find longer HSPs containing them. The word 
hits are extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum achieved 
value; the cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or the end of either sequence is reached. The BLAST 
algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The 
BLAST program uses as defaults a wordlength (W) of 1 1, the BLOSUM62 scoring matrix 
{see Henikoff & Henikoff, Proc. Natl. Acad. Sci: USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N— 4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Natl Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. 

[35] "Conservatively modified variants" applies to both amino acid and 
nucleic acid sequences. With respect to particular nucleic acid sequences, "conservatively 
modified variants" refers to those nucleic acids that encode identical or essentially identical 
amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 
number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
position where an alanine is specified by a codon, the codon can be altered to any of the 
corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
variations. Every nucleic acid sequence herein that encodes a polypeptide also describes 
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every possible silent variation of the nucleic acid. One of skill will recognize that each codon 
in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
identical molecule. Accordingly, each silent variation of a nucleic acid that encodes a 
polypeptide is implicit in each described sequence. 

[36] As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. Conservatively 
modified variants of the polypeptides described herein are encompassed by the invention. 

[37] The following eight groups each contain amino acids that are 
conservative substitutions for one another: 



1) 


Alanine (A), Glycine (G); 


2) 


Aspartic acid (D), Glutamic acid (E); 


3) 


Asparagine (N), Glutamine (Q); 


4) 


Arginine (R), Lysine (K); 


5) 


Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 


6) 


Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 


7) 


Serine (S), Threonine (T); and 


8) 


Cysteine (C), Methionine (M) 



{see, e.g., Creighton, Proteins (1984)). 

[38] The phrase "a nucleic acid sequence encoding" refers to a nucleic acid 
which contains sequence information for a structural RNA such as rRNA, a tRNA, or the 
primary amino acid sequence of a specific protein or peptide, or a binding site for a trans- 
acting regulatory agent. This phrase specifically encompasses degenerate codons {i.e., 
different codons which encode a single amino acid) of the native sequence or sequences that 
may be introduced to conform to codon preference in a specific host cell. 

[39] The term "recombinant" or "recombinantly altered" when used with 
reference, e.g., to a cell, or nucleic acid, protein, or vector, indicates that the cell, nucleic 
acid, protein or vector, has been modified by the introduction of a heterologous nucleic acid 
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or protein or the alteration of a native nucleic acid or protein, or that the cell is derived from a 
cell so modified. Thus, for example, recombinant cells express genes that are not found 
within the native (nonrecombinant) form of the cell or express native genes that are otherwise 
abnormally expressed, under-expressed or not expressed at all. 

[40] The term "heterologous" when used with reference to portions of a 
nucleic acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship to 
each other in nature (e.g., a fusion protein). 

[41] An "expression cassette" is a nucleic acid, generated recombinantly or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression cassette can be part of a plasmid, virus, 
or nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[42] Figure 1 illustrates the diagram of pOSEL144 a shuttle vector between 
E. coli and Lactobacillus sp (A). The rep A and erm genes are derived from L. reuteri and the 
ColEl replicon is from pBluescript. A multiple cloning site for insertion of target gene is 
located between erm and ColEl ori. The expression cassettes for either protein secretion (B) 
or anchoring (Q contain unique restriction sites. 

[43] Figure 2 illustrates results from Western analysis of release of 2D-CD4 
by L. jensenii Xna harboring the 2D-CD4 expression plasmid pOSEL65 1 (Xna-65 1) relative 
to the control plasmid pOSEL144 (Xna- 144). Samples were collected either at optical 
density at 600 nm (OD600) 0.8 or 3.2. Proteins were collected by precipitation with TCA 
from culture supernatants from corresponding bacterial cells and subjected to electrophoretic 
separation under reducing SDS-PAGE for Western blot analysis with polyclonal anti-CD4 
antibodies, T4-4. The pOSEL651 construct contained the following elements: P23 promoter- 
CbsAss-2D-CD4. 

[44] Figure 3 illustrates that the engineered L. jensenii Xna secretes 
biologically active 2D-CD4. Z. jensenii Xna- 144 or Xna-65 1 bacteria were grown in MRS 
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broth (37°C and 5% C0 2 ). Culture supernatants were used for CD4 ELISA (A) and gpl20 
binding assay (5). 2D-CD4 concentrations were determined from standard curves generated 
with refolded 2D-CD4 expressed in E. coli. 

[45] Figure 4 illustrates that L. jensenii derived 2D-CD4 inhibits HIV-1 
infectivity. The inhibitory effect of the 2D-CD4 protein produced by Xna-65 1 after partial 
purification (A) and conditioned medium 199 containing 2D-CD4 (B) on HIV-1 infectivity of 
HeLa cells expressing CD4-CXCR4 was determined by viral entry assay. The results (mean 
i SD) from triplicate determinations in a single experiment were presented and confirmed in 
a series of independent experiments. The luciferase activity from virus infected cells without 
addition of 2D-CD4 samples was defined as 100%. This corresponded to relative light units 
(RLU) values of 276 ± 21 in panel A, 350 ± 19 in panel B. 

[46] Figure 5 illustrates that co-incubation of L. jensenii Xna-65 1 bacteria 
with HeLa-CD4 cells and recombinant viruses inhibit HIV-1 infectivity. L. jensenii Xna-144 
or Xna-65 1 bacteria in medium 199 were co-incubated with Env-pseudotyped HIV-l H xB2 and 
,HeLa-CD4-CXCR4, fori hr at 37°C in the viral entry assay. The results (mean ± SD) from 
triplicate determinations in a single experiment were presented and confirmed in a separate 
experiment. The values presented represent luciferase activity relative to that derived from 
cells not treated with samples, which was defined as 100%. This corresponded to absolute 
RLU values of 324 ±19. 

[47] Figure 6 illustrates secretion of M6i 2 o-HveC V domain by modified Z,. 
jensenii 1 153. Bacterial cells transformed with pOSEL175, 215, and 197 were grown in 
Rogosa SL broth. Proteins in cell-free conditioned media were TCA precipitated for Western 
analysis and reacted with a polyclonal M6 antiserum. The expression constructs contained 
the following elements: P 59 promoter-CbsAss-M6i 2 o-HveC V in pOSEL215; P 59 promoter- 
CbsAss-M6i2o-2D-CD4 in pOSEL197. M6 m represents the N-terminal domain of S. 
pyogenes M6. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Introduction 

[48] The present invention provides mucosal surface-colonizing species of 
Lactobacillus that are recombinant^ altered to express biologically active proteins and uses 
thereof. The inventors have surprisingly discovered that, in contrast to other mucosal 
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surface-colonizing lactobacilli such as L. crispatus, L. jensenii can be readily transformed to 
express biologically active proteins, including proteins of mammalian origin. 

[49] Biologically active proteins include, for example, proteins with at least 
one disulfide bond. Exemplary biologically active proteins include, anti-viral, anti-bacterial 
or anti-fungal or therapeutic polypeptides, such as anti-inflammatory proteins, growth factors 
and growth factor antagonists, sugar-binding polypeptides, antibodies and enzymes. The 
bacteria of the invention are useful, e.g., for expressing biologically active proteins within the 
vagina or gastrointestinal tract for the purpose of preventing and/or treating diseases. 

II. Recombinant Techniques 

A. Molecular Biology Methods 

[50] TWs invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et al., Molecular Cloning, A Laboratory Manual (3rd ed. 2001); Kriegler, Gene 
Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in Molecular 
Biology (Ausubel et al, eds., 1994)). 

[51] For nucleic acids, sizes are given in either kilobases (kb) or base pairs 
(bp). These are estimates derived from agarose or acrylamide gel electrophoresis, from 
sequenced nucleic acids, or from published DNA sequences. For proteins, sizes are given in 
kilodaltons (kDa) or amino acid residue numbers. Proteins sizes are estimated from gel 
electrophoresis, from sequenced proteins, from derived amino acid sequences, or from 
published protein sequences. 

[52] Oligonucleotides that are not commercially available can be 
chemically synthesized according to the solid phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
automated synthesizer, as described in Van Devanter et al, Nucleic Acids Res. 12:6159-6168 
(1984). Purification of oligonucleotides is by either native acrylamide gel electrophoresis or 
by anion-exchange HPLC as described in Pearson & Reanier, J. Chrom. 255:137-149 (1983). 

[53] The sequence of the cloned genes and synthetic oligonucleotides can 
be verified after cloning using, e.g., the chain termination method for sequencing double- 
stranded templates of Wallace et al, Gene 16:21-26 (1981). 
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E. Cloning Methods for the Isolation of Nucleotide Sequences 
Encoding Desired Proteins 

"[54] In general, the nucleic acids encoding the subject proteins are cloned 
from DNA libraries that are made from cDNA or genomic DNA. The particular sequences 
can be located by hybridizing with an oligonucleotide probe, the sequence of which can be 
derived from the sequences disclosed herein or are known in the art, which provide a 
reference for PCR primers and defines suitable regions for isolating gene-specific probes. 
Alternatively, where the sequence is cloned into an expression library, the expressed 
recombinant protein can be detected immunologically with antisera or purified antibodies 
made against a polypeptide of interest, including those disclosed herein. 

[55] Methods for making and screening genomic and cDNA libraries are 
well known to those of skill in the art (see, e.g., Gubler & Hoffinan, Gene 25:263-269 (1983); 
Benton & Davis, Science, 196:180-182 (1977); Adams et al (eds). Automated DNA 
Sequencing and Analysis, Academic Press, London (1994), and Sambrook, supra). Cells 
expressing a protein of interest are useful sources of RNA for production of a cDNA library. 

[56] Briefly, to make the cDNA library, one should choose a source that is 
rich in mRNA. The mRNA can then be made into cDNA, ligated into a recombinant vector, 
and transfected into a recombinant host for propagation, screening and cloning. For a 
genomic library, the DNA is extracted from a suitable tissue or cell and either mechanically 
sheared or enzymatically digested to yield fragments of preferably about 5-100 kb. The 
fragments are then separated by gradient centrifugation from undesired sizes and are 
constructed in bacteriophage lambda vectors. These vectors and phage are packaged in vitro, 
and the recombinant phages are analyzed by plaque hybridization. Colony hybridization is 
carried out as generally described in Grunstein et al, Proc. Natl. Acad. Sci. USA, 72:3961- 
3965 (1975). 

[57] An alternative method combines the use of synthetic oligonucleotide 
primers with polymerase extension on an mRNA or DNA template. This polymerase chain 
reaction (PCR) method amplifies the nucleic acids encoding the protein of interest directly 
from mRNA, cDNA, genomic libraries or cDNA libraries. Restriction endonuclease sites can 
be incorporated into the primers. Polymerase chain reaction or other in vitro amplification 
methods may also be useful, for example, to clone nucleic acids encoding specific proteins 
and express said proteins, to synthesize nucleic acids that will be used as probes for detecting 
the presence of mRNA encoding a polypeptide of the invention in physiological samples, for 
nucleic acid sequencing, or for other purposes (see, U.S. Patent Nos. 4,683,195 and 
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4,683,202). Genes amplified by a PGR reaction can be purified from agarose gels and cloned 
into an appropriate vector. 

[58] Appropriate primers and probes for identifying the genes encoding a 
polypeptide of the invention from tissues or cell samples can be derived from the sequences 
described in the art. For a general overview of PCR, see, Innis et al, PCR Protocols: A 
Guide to Methods and Applications, Academic Press, San Diego (1990). 

[59] A polynucleotide encoding a polypeptide of the invention can be 
cloned using intermediate vectors before transformation into Lactobacillus. These 
intermediate vectors are typically prokaryote vectors or shuttle vectors. 

C. Transformation Techniques 

[60] Appropriate bacterial host strains are selected for, e.g. their 
transformation ability, ability for heterologous protein expression, and/or mucosal surface. 
The bacterial host will be rendered competent for transformation using standard techniques, 
such as the rubidium chloride method or electroporation {see, e.g., Wei et aL, J. Microbiol 
Methods 21:97-109 (1995). 

[61] Transformation of L. jensenii by electroporation can be performed by 
modifying standard methods as described in, e.g., Luchansky et al. (J. Dairy Sci. 74: 3293- 
3302 (1991). Briefly, freshly inoculated L. jensenii are cultured in MRS broth (e.g., to 0.6- 
0.7 at OD 6 oo at 37°C and 5% CO2). The bacterial cells are harvested, washed and re- 
suspended in a cold (e.g., 4°C) solution of sucrose and MgC^. Competent cells are then 
mixed with DNA and placed in a chilled gap cuvette and electroporated. Afterward, cells are 
allowed to recover in pre-warmed broth (e.g., for about two hours at 37°C), prior to being 
plated on selective agar plate containing an antibiotic other selective agent. 

[62] Optionally, antibiotic pretreatment of the user can be used to pre-clear 
the mucosal surface of resident bacteria prior to introduction of the bacteria of the invention 
into the vagina or gastrointestinal tract. See, e.g., Freter et al., Infect. Immun., 39:686-703 
(1983). Antibiotics can be provided orally or can be applied directly to the vagina. 

[63] A first method involves repetitively selecting for rapid colonizing 
bacteria on animal or human mucosal layers. For example, one applies a wildtype bacterial 
strain to a mucosal surface and repetitively isolates and in vitro cultures bacteria, returning at 
each step to the mucosal surface. Ultimately, a bacterium with an enhanced colonizing 
ability is obtained. 
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[64] A second method involves expression of fusion proteins on the surface 
of recombinant bacteria. The fusion protein consists of a host-binding domain linked to a 
polypeptide of interest. The host-binding domain will allow the bacteria to bind to certain 
determinants (protein or carbohydrate) on a selected host mucosal surface with high affinity, 
thus conferring the bacteria a survival advantage over the resident microflora. 

[65] The third method involves induction of resident microflora to express a 
heterologous protein by introducing the gene via bacteriophage. A number of bacteriophage 
vectors have been developed for use in different bacteria. For example, a bacteriophage 
vector based on the temperate bacteriophage <£adh can be used (see, e.g., Raya et al, J. 
Bacteriol 174:5584-5592 (1992) and Fremaux et al, Gene 125:61-66 (1993)). This vector 
undergoes site-specific integration into the host chromosome at defined phage (attP) and 
bacterial (attB) attachment sites. Similarly, Lactobacillus-specific bacteriophage can be used 
to transduce vectors or other polynucleotides into the Lactobacillus chromosome. 
Lactobacillus-speci&c phage include mv4 (Auvray et al, J. Bacteriol, 179:1837-1845 
(1997)), <£adh (Fremaux et al, Gene 126:61-66 (1993)), <£gle (Kakikawa et al, Gene 
175:157-165 (1996), and those belonged to Bradley's groups A or B in vaginal lactobacillus 
isolates (Kilic et al, Clin. Diagn. Lab. Immunol 8:31-39 (2001)). 

[66] Certain agents that do not irritate mucosal epithelial cells may also be 
added to a unit dose of the bacteria in capsules or tablets to aid in colonization. Many 
bacteria on mucosal surfaces secrete capsular materials that coalesce to form a biofilm that 
covers the entire mucosal surface. It may be beneficial to add an enzyme that digests this 
biofilm material to promote penetration of the engineered bacteria into the biofilm for more 
successful colonization. The enzymes include DNAses, peptidases, collagenases, 
hyaluronidases, and other carbohydrate degrading enzymes. Antibiotics to which the 
engineered bacteria itself is not susceptible may also be added to decrease the number of 
resident bacteria on the mucosal surface in order to make room for the engineered bacteria. 

D. Expression 

[67] Expression cassettes of the invention can include a variety of 
components to regulate expression and localization of the polypeptides of the invention. For 
example, expression cassettes can include promoter elements, sequences encoding signal 
sequences, a coding sequence for the polypeptide of interest and anchor sequences. 

[68] Expression of the heterologous polynucleotides or polypeptides can be 
constitutive (e.g., using P59 (van der Vossen et al, Appl. Environ. Microbiol 58:3142-3149 
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(1992)) or P 23 (Elliot et al, Cell 36:21 1-219 (1984)) promoters). Alternatively, expression 
can be under the control of an inducible promoter. For example, the Bacillus amylase 
(Weickert etaL, J.Bacteriol. 171:3656-66 (1989)) or xylose (Kim etal. Gene 181:71-76 
(1996)) promoters as well as the Lactococcus nisin promoter (Eichenbaum et al., Appl 
Environ. Microbiol. 64:2763-2769 (1998)) can be used to drive inducible expression. In 
addition, acid or alkaline-induced promoters can be used. For example, promoters that are 
active under the relatively acidic conditions of the vagina (e.g., those described in U.S. Patent 
No. 6,242,194) can be used. Alternatively, promoters can be used that are induced upon 
changes in the vagina in response to semen. For example, alkaline-induced promoters are 
used to induce expression in response to the increased alkaline conditions of the vagina 
resulting from the introduction of semen. 

[69] A variety of signal and anchor sequences are known to direct 
expression of polypeptides to the membrane, extracellular space or the cell wall (e.g., by 
covalent attachment to peptidoglycan). Exemplary signal sequences include the signal 
sequence from a- Amylase of L. amylovorus (Giraud & Cuny, Gene 198:149-157 (1997)) or 
the signal sequence from the S-layer gene (cbsA) of L. crispatus (e.g., 
MKKNLRIVSAAAAALLAVAPVAA (SEQ ID NO:l) or 

MKKNLRIVSAAAAALLAVATVSA (SEQ ID NO:2)). Signal sequences are typically 
located at the amino-terminus of a polypeptide. 

[70] Anchor sequences are typically located at the carboxyl terminus of an encoded protein 
sequence. Anchor sequences include, e.g., a cell wall associated sequence; the sequence 
LPQ(S/A/T)(G/A), where residues in parentheses indicate different options at that position; 
and a hydrophobic sequence, and, optionally, a charged sequence. In some embodiments, the 
anchoring sequence comprises 

VTRTINVVDPITGKISTSVQTAKFTREDKNSNAGYTO 

VNVEQIKGWAKVDGNVD 

ADGKNKDDLPDGTKYTWKEW 

ESNRDTLSKEANTGNTNVAKAATVTSSKV^ 

LLGLGVNRKKRQK (SEQ ID NO:4), 

KKAEEVKNNSNATQKEVDDATNb^ 

KYNNASDDTKSKFDEALKKAEE 

KDAIM)AIKDA1WAKGTO^ 

ATKmKNTLNNLK^ 
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GAjSIFAiaa^TCT^DEA (SEO JD NO:5). or sequences substantially identical to SEQJD 
NO:4 or SEQIDNO:5. 

[71] Correct localization and folding of a polypeptide can be determined 
using standard methods. For example, cell wall enriched fractions of Lactobacillus can be 
obtained by suspending the bacteria in a buffered, solution (e.g., 25% sucrose, 1 mM EDTA, 
10 mM Tris-HCl, pH 8.0) followed by treatment with cell wall degrading enzymes (e.g., 
lysozyme and mutanolysin) and then separating out the resulting protoplasts by differential 
centrifugation (Piard et ah, J. Bacteriol 179:3068-3072 (1997)). Fractions can then be 
screened by Western blotting to confirm expression within the cell wall. 

[72] Folding and biological activity of an expressed polypeptide can also be 
determined using standard methods. For example, ELISA assays using antibodies specific 
for the natively folded polypeptide can be used to confirm folding and three-dimensional 
structure of the polypeptide. Biological activity assays will of course vary depending on the 
activity of the polypeptide. For example, for polypeptides that bind to viral proteins, the 
expressed polypeptide can be tested for its ability to bind a viral protein using standard 
binding assays. For anti-inflammatory molecules, the expressed polypeptide can be assayed 
for its ability to antagonize substances that promote inflammation. 

[73] When synthesizing a gene for improved expression in a host cell, it is 
- desirable to design the gene such that its frequency of codon usage approaches the frequency 
of preferred codon usage of the host cell. The percent deviation of the frequency of preferred 
codon usage for a synthetic gene from that employed by a host cell is calculated first by 
determining the percent deviation of the frequency of usage of a single codon from that of the 
host cell followed by obtaining the average deviation over all codons. 

[74] The polynucleotide sequence encoding a particular polypeptide can be 
altered to coincide with the codon usage of a particular host For example, the codon usage 
of Lactobacillus can be used to derive a polynucleotide that encodes a polypeptide of the 
invention and comprises preferred Lactobacillus codons. The frequency of preferred codon 
usage exhibited by a host cell can be calculated by averaging the frequency of preferred 
codon usage in a large number of genes expressed by the host cell. This analysis is 
preferably limited to genes that are highly expressed by the host cell. Pouwels & Leunissen 
(Nucleic Acids Res. 22:929-936 (1994)), for example, provides the frequency of codon usage 
by highly expressed genes exhibited by various Lactobacillus species. Codon-usage tables 
are also available via the internet. 
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Hff. Proteins of the invention 

[75] The polypeptides of the invention expressed in Lactobacillus sp. can 
be any polypeptide. Typically, the polypeptides of the invention are expressed under 
conditions to allow for biological activity of the polypeptides. In some embodiments, a 
disulfide bond exists in the expressed polypeptide. In some embodiments, the disulfide bond 
is required for the polypeptide's biological activity. 

[76] Polypeptides of the invention can be of any size molecular eight. For 
example, the polypeptides can be between about 100 and 100,000 daltons, between about 500 
and 40,000 daltons, between about 500 and 10,000 daltons, between about 10,000 and 50,000 
daltons, or between about 50,000 and 100,000 daltons. 

[77] Examples of classes of polypeptides that can be used according to the 
methods of the invention to prevent or treat pathogen infection include, e.g., anti-viral 
polypeptides, anti-bacterial polypeptides, anti-fungal polypeptides, and polypeptides that bind 
to viruses, bacteria or fungi, including antibodies, antibody fragments, or single-chain 
antibodies. 

[78] In some cases, the polypeptides of the invention will be a receptor that 
viral or bacterial pathogens bind to infect a host. Alternatively, the polypeptides are agents 
that, e.g., inhibit pathogen replication, viability, entry or otherwise bind to the pathogen. In 
some embodiments, the polypeptides of the invention bind or inhibit sexually transmitted 
pathogens and other pathogens transmitted to or from the vagina. For example, since viruses 
require binding to a receptor on the target cell surface for infection, strategies directed at 
inhibiting the interaction of a virus with its host receptor are effective at preventing infection. 

[79] Exemplary anti- viral polypeptides include, e.g., CD4 or virus-binding 
fragments thereof (eg., 2D-CD4) (e.g., Orloff et al 9 J. Virol 67:1461-1471 (1993)), stable 
CD4 trimers formed via a trimeric motif (e.g., Yang et ah, J. Virol 76:4634-4642 (2002), a 
dodecameric CD4-Ig fusion protein (Arthos et al, J. Biol Chem. 277:1 1456-1 1464 (2002)), 
a-defensins (e.g. Zhang et al, Science 298:995-1000 (2002), CD4 in fusion with a single 
chain variable region of the 17b monoclonal antibody (mAb) (Dey et al, J. Virol. 77:2859- 
2865 (2003), cyanovirin-N (e.g., Bolmstedt et al, Mol Pharmacol 59:949-54 (2001)), HveC 
(e.g., Cocchi et al, Proc. Natl Acad. Sci. USA. 95:15700-15705 (1998)), and intercellular 
adhesion molecule- 1 (ICAM-l). Other embodiments include, e.g., viral receptors or heparin 
or heparin-like molecules, mannose-binding lectin, including dendritic cell-specific ICAM-3 
grabbing nonintegrin (e.g., Geijtenbeek et al, Cell 700:587-597 (2000); Feinberg et al, 
Science 294:2163-2166 (2001)), anti-HSV-1 gpl20 single-chain antibody (e.g. Marasco et 
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al, Proc. Natl. Acad. Sci. USA. 90:7889-7893 (1993)), human mAb M2, recognizing the 
CD4-binding site of HIV-1 gpl20 (e.g. Saphire et al t Science 293:1155-1 159 (2001)) or 
other molecules with similar specificity, including neutralizing antibodies that bind to HS V 
(e.g., Burioni et al., Proc. Natl. Acad. ScL USA. 91:355-359 (1994)), and HIV-1 entry 
inhibitory protein (e.g., Root et al, Science 291:884-888 (2001); Sia et al, Proc. Natl Acad. 
Sci. USA. 99:14664-14669 (2002)). 

[80] Infection with human papillomaviruses (HPVs) is a factor that is 
associated with development of cervical cancer (e.g., zurHausen, Virology 184:9-13 (1991); 
Stanley, Best Prat. Res. Clin. Obstet. Gynaecol. 15:663-676 (2001)). Therefore, the presence 
of molecules that inhibit or bind to HPV is useful for preventing both HPV infection and the 
development of cervical cancer. Exemplary anti-HPVs polypeptides include, e.g. 
neutralizing antibodies that bind HPV type 16 E6 or E7 protein (e.g. Mannhart et al, Mol. 
Cell Biol 20:6483-6495 (2000)), HPV-binding proteins, or HPV proteins that can be used to 
elicit an immune response directed to the virus. 

[81] Viral transmissions, like HIV, HS V, or HPV, involves mother to baby 
transmission, with vaginal delivery being one of transmission routes (Montgomery et al, 
Lancet 360:643-644 (2002); Brown et al, JAMA 289:203-209 (2003); Puranen et al, Am. J. 
Obstet. Gynecol. 174:694-699(1996)). Therefore, expression of molecules that inhibit or 
bind to HIV, HSV or HPV in vaginal colonized lactobacilli is useful for reducing the load of 
infectious viral particles in infected women, as a means of restricting vertical transmission of 
the virus from the pregnant mothers to their newborns, during delivery. 

[82] The capacity to bind a pathogen such as a virus or bacteria may be 
conferred onto the bacteria of the invention in at least several ways. The first is by making 
the bacteria express on its surface the normal host receptor for the virus, such as ICAM-1 for 
human rhinovirus (HRV) (major group) and CD4 for HIV. These are normal human proteins 
and the complete sequences of many of these genes have been determined and are stored in 
the database GenBank. 

[83] A second method is by expressing on the bacterial surface an antibody 
fragment or other polypeptide that binds to a conserved determinant on the viral surface, such 
as VP4 on polio virus, or gpl20 on HIV. Antibody fragments (and peptides) specific for 
essentially any antigen can be selected, e.g., from a phage-display library (Marks et al, J. 
Biol. Chem. 267:16007-16010 (1992)). Antibodies can be directed to any epitope on or 
associated with a pathogen as well as other epitopes such as those discussed below. 
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[C4] A third method involves the expression of carbohydrate-binding 
polypeptides on the surface of the bacteria. Examples of these molecules include heparin- 
binding polypeptides, or mannose-binding polypeptides. 

[85] Anti-bacterial polypeptides include those that bind to or inhibit growth 
or colonization by uropathogenic E. coli. Exemplary anti-bacterial polypeptides include, e.g., 
permeability-increasing protein against gram-negative bacteria (Levy, Expert Opin. Investig. 
Drugs 11:159-167 (2002), mammalian ant-microbial peptides, j3-defensins (Ganz & Lehrer, 
Pharmacol. Titer. 66:191-205 (1995), bacteriocins (e.g., Loeffler et ah, Science 294:2170- 
2172 (2001) and antibodies that specifically bind to the bacteria. 

[86] Anti- fungal polypeptides include those that bind to or inhibit growth or 
colonization by fungi such as Candida. 

[87] Additional examples of biologically-active polypeptides useful 
according to the invention include therapeutic polypeptides or agents such as anti- 
inflammatory molecules, growth factors, molecules that bind to, or antagonize, growth 
factors, therapeutic enzymes, antibodies (including, e.g., antibody fragments or single-chain 
antibodies) and molecules that inhibit or treat cancer including cervical cancer. These 
examples are not intended to be limiting as numerous other therapeutically active 
polypeptides can readily be cited. 

[88] Anti-inflammatory molecules include, e.g., antibodies or other 
molecules that specifically bind to tumor necrosis factor (TNF) or interleukin-8 (IL-8). Other 
exemplary anti-inflammatory molecules include IL-1 0 and IL- 1 1 . 

[89] Growth factors useful in the invention include, e.g., those involved in 
local tissue repair such as keratinocyte growth factor (KGF), heparin-binding epidermal 
growth factor (HB-EGF), fibroblast growth factor (FGF) and transforming growth factor-beta 
(TGF-P), or antagonists of these molecules. 

[90] Therapeutic enzymes include, e.g., nitric oxide (NO) synthase. 

[91] Anti-cancer molecules include those that induce apoptosis, that 
regulate cell cycle such as p53, or that act as a vaccine to target cancer-specific epitopes. 

[92] Vaccine molecules useful in the invention include polypeptides that 
elicit an immune response to viruses, bacteria, or fungi. Exemplary viral vaccines elicit 
response to, e.g., HIV, HPV, HSV, or smallpox. 
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IV* Delivery 

[93] Delivery of engineered bacteria to a desired mucosal surface depends 
on the accessibility of the area and the local conditions. For example, engineered bacteria 
may be placed in a saline solution or in a foam for delivery onto the vaginal mucosa Foams 
can include, e.g., one or more hydrophobically modified polysaccharides such as cellulosics 
and chitosans. Cellulosics include, for example, hydroxyethyl cellulose, hydroxypropyl 
cellulose, methyl cellulose, hydroxypropylmethyl cellulose, hydroxyethyl methyl cellulose, 
and the like. Chitosans include, for example, the following chitosan salts; chitosan lactate, 
chitosan salicylate, chitosan pyrrolidone carboxylate, chitosan itaconate, chitosan niacinate, 
chitosan formate, chitosan acetate, chitosan gallate, chitosan glutamate, chitosan maleate, 
chitosan aspartate, chitosan glycolate and quaternary amine substituted chitosan and salts 
thereof, and the like. Foam can also include other components such as water, ethyl alcohol, 
isopropyl alcohol, glycerin, glycerol, propylene glycol, and sorbitol Spermicides are 
optionally included in the bacterial composition. Further examples of foams and foam 
delivery vehicles are described in, e.g., U.S. Patent Nos. 5,595,980 and 4,922,928. 

[94] Alternatively, the bacteria can be delivered as a suppository or pessary. 
See, e.g., U.S. Patent No. 4,322,399. In soine embodiments, the bacteria of the invention are 
prepared in a preservation matrix such as described in U.S. Patent No. 6,468,526 and are 
delivered in a dissolvable element made of dissolvable polymer material.and/or complex 
carbohydrate material selected for dissolving properties, such that it remains in substantially 
solid form before use, and dissolves due to human body temperatures and moisture during 
use to release the agent material in a desired timed release and dosage. See, e.g., U.S. Patent 
No. 5,529,782. The bacteria can also be delivered in a sponge delivery vehicle such as 
described in U.S. Patent No. 4,693,705. 

[95] In some embodiments, the bacteria are administered orally. For 
example, a daily dose of about 10 8 lactobacilli can be used to restore the normal urogentital 
flora. See, e.g., Reid etai, FEMSImmuno. Med. Microbiol. 32, 37-41 (2001). 

[96] In some embodiments, applications of engineered bacteria to a mucosal 
surface will need to be repeated on a regular basis; optimal dosing intervals are routine to 
determine, but will vary with different mucosal environments and bacterial strain. The 
dosing intervals can vary from once daily to once every 2-4 weeks. 

[97] In embodiments where bacteriophage are introduced to transform 
native Lactobacillus, the nucleic acid of the selected bacteriophage may be manipulated such 
that the heterologous gene(s) replaces the genes coding for bacteriophage coat proteins, 
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rendering the bacteriophage replication-defective. Adding these recombinant DNA 
molecules into cell lysates containing functional bacteriophage proteins will lead to assembly 
of functional bacteriophage particles carrying the heterologous gene(s). These replication- 
defective bacteriophage particles can then be introduced onto a desired mucosal surface to 
infect selected floral bacteria. The typical dosage would be 10 8 to 10 12 PFU/ml applied to the 
mucosal surface. The proportion of solution to the treated surface should approximate 0.1 to 
1.0 ml per square centimeter of mucosal surface. The vehicle would be similar to the vehicle 
described above for the bacteria. 

[98] All publications and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 

EXAMPLES 

[99] The following example is offered to illustrate, but not to limit, the 
claimed invention. 

Introduction 

[100] The present invention involves the genetic modification of human 
isolates of Lactobacillus jenseni to express, secrete, or surface anchor biologically active 
polypeptides. To demonstrate this invention, high affinity HIV- or HSV-binding proteins 
were expressed in Lactobacillus jensenii. These recombinant bacteria are intended to be 
instilled into the vagina, where they will colonize the mucosa and produce virus-binding 
proteins that either remain associated with the cell-surface of these bacteria or are secreted 
into the surrounding mucosal biofilm matrix. Subsequent binding of the infectious viral 
particles to these proteins traps the virus and impedes its transmigration through the 
protective mucosal barrier. Since an increased residence time within the mucosa will expose 
the viral particles to virucidal substances (like lactic acid and peroxide), a significant 
population of the virus introduced into the vagina from the ejaculate of the infected male 
partner is likely to become inactivated when retained within this hostile environment. In this 
manner, the entire biofilm may be transformed into a more effective biological barrier, which 
serves as a protective shield against viral transmission to the female host. 

[101] Little, if any, of this work has been devoted toward the use of human 
vaginal isolates of lactobacilli as expression hosts. Indeed, as described herein, it was 
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discovered by the inventors that some vaginal isolates (e.g., L crispatus) could not be 
transformed, while Ljensenii could be efficiently transformed. The genetic modification of 
colonizing vaginal isolates of Ljensenii is described herein. These bacteria have been 
modified to express a prototypical HIV-binding protein, 2D-CD4. The 2D-CD4 molecules 
produced by these genetically modified bacteria exhibit potent biological activity, including 
the ability to bind to gpl20 and to inhibit HIV infectivity in vitro. This work demonstrates 
the successful expression of a complex disulfide-bonded mammalian protein in a vagina- 
colonizing isolate of lactobacillus. Furthermore, it enables the clinical efficacy of the 
engineered bacteria to block HIV transmission in women. 

Experimental Methods 

[102] Bacterial strains and plasmids. Naturally occurring human vaginal 
isolates of Ljensenii and I. crispatus were obtained from vaginal swabs of healthy 
volunteers. Culture media for the routine propagation of L. jensenii and L. crispatus (37°C, 
5% C0 2 ) included Mann Rogosa Sharpe (MRS) broth and Rogosa SL broth , and were 
obtained from Difco (Detroit, MI). Plasmids were introduced by electroporation into i. 
jensenii by the following method. 

[103] For plasmid selection and maintenance, erythromycin (20 |Jg/ml) was 
added to either liquid or solid media. For shuttle plasmid construction and maintenance, E. 
coli DH12s cells bearing the recombinant constructs were grown in LB broth (Difco) at 37°C, 
supplemented with ampicillin (100 (ig/ml). E. co/f-derived plasmids were transformed into L. 
jensenii using modifications of the protocol of Luchansky et aL, J. Dairy Set 74:3293-3302 
(1991). Briefly, freshly inoculated L. jensenii were cultured in MRS broth to 0.6-0.7 at OD 6 oo 
at 37°C and 5% CO2. The bacterial cells were harvested, washed and re-suspended in 952 
mM sucrose and 3.5 mM MgCh at 4°C. Using a pre-chilled 0.2 cm gap cuvette, competent 
cells were added to 1- 2 \ig of DNA and electroporated immediately at 2.5 kV/cm and 200 
ohms using Gene Pulser II (Bio-Rad, Hercules, CA). Afterward, cells were allowed to 
recover in prewarmed MRS broth for two hours at 37°C, prior to being plated on selective 
MRS agar plate containing 20 |ig/ml erythromycin. 

[104] Expression vector. The primary shuttle vector used in these studies 
was pOSEL144 (Figure 1), a modified version of pLEM7 (Fons etal, Plasmid. 37:199-203 
(1997)) in which a colEl origin of replication was added from pBluescript at the Acc I site. 
The partial IS element was deleted by first cutting with Sma I, partially digesting with Nde I, 
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blunting with TO.enow fragment and then re-ligating. Finally, the plasmid was subjected to 
site-directed mutagenesis to remove two Mfe I sites within the enn gene, resulting in 
pOSEL175. 

[105] Cloning of S-Iayer Signal Sequence. S-layer signal sequence 
(CbsAss) of L, crispatus of human vaginal isolates was cloned to target for protein secretion 
in lactobacillus. The S-layer gene (cbsA) was amplified from chromosomal DNA isolated 
from a proprietary Xna strain of L. crispatus using primers designed to flank the sequence of 
the cbsA gene fromL crispatus JCM5810 (Martinez ei al, J, BacterioL 182:6857-6861 
(2000)). Two primers, 5 * -GCGAATTC A AGG AGG AAA AGA CCA CAT-3' and 5>- 
CCA GCT AGC TGA AAC AGT AGA AAC GGC-3', were designed to amplify the cbsA 
sequence from the putative ribosome binding site to the signal peptidase cleavage site, with 
EcoRl and Nhel sites added to the 5' and 3' ends, respectively. The amplified CbsA ss was 
then digested and used for cloning in the expression cassette. 

[106] Construction of expression cassettes in L. jenseniL Oligonucleotide 
primers for PCR amplification of various portions of the fusion constructs detailed in this 
study include the following: 

P 59 .f 5^GCGAGCTCCCCAAAGAAGCGCGTAATATC-3' 

P 59 .r 5 '-GGAAACACGCTAGCACTAACTTCATT-3 ' 

P 23 .f 5 '-GTGGAGCTCCCCGAAAAGCCCTGACAACCC-3 • 

P23.r 5 '-GGAAACACGCTA^ACTAACTTCATT-3 ' 

2D-CD4.f S'-GCGGCTAGCAAGAAAGTTGTTTTAGGTAAA-S' 

2D-CD4.r 5 5 -GCACAATTCTGATGCCTTTTGAAAAGCTAA-3 ' 

CbsAss.f 5 '-GCGAATTCAAGGAGGAAAAGACCAC AT-3 ' 

CbsAss.r 5 ^CCAGCTAGCTGAAACAGTAGAAACGGC^ 9 

HveC V & y.t 5 '-GGCGCTAGCCAGGTGGTCCAGGTGAACGACTCC-3 ' 

HveC V.r 5 *-CCGCAAlTGCATCACCGTGAGATTGAGCTGGCT-3 » 

HveC 7.r 5 '-GCGCAATrGGAACATCCTAGCTCTrGTCCT^' 



[107J The promoters used in the study were P 2 3 and P 59 from Lactococcus 
lactis (van der Vossen et al.Appl Environ. Microbiol. 53:2452-2457 (1987)). The first 183 
residues comprising the N-terminal two extracellular domains of human CD4 (2D-CD4) were 
recoded by assembly PCR (Stemmer et al. y Gene 164:49-53 (1995)) to conform to a preferred 
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lactobacillus codon usage. The DNA sequence encoding for cyanovirin-N (Boyd et al. 9 
Antimicrob. Agents Chemother. 41:1521-1530 (1997)) was also synthesized by assembly 
PCR. The DNA sequence encoding HveC- V (Knimmenacher al ,J. Virol 73:8127-8137 
(1999) was obtained by PCR amplification of a human cDNA library whereas the DNA 
sequence encoding a secreted form of HveOy (Lopez etal, J. Virol 75:5684-5691 (2001)) 
was obtained by PCR amplification of a human cDNA clone encoding this protein (IMAGE 
clone #143 1 77). All expression constructs were confirmed by DNA sequence analysis. 

[108] Analysis of heterologous protein expression in £. jenseniu The 
recombinant L.jensenii cells were grown (37°C and 5% C0 2 ) in Rogosa broth that was 
buffered with 100 mM HEPES (pH 7.4). The cell cultures were grown to various densities 
and then the bacteria were removed by centrifugation (3,800 x g, 15 m). The supernatants 
were transferred to a fresh container and the proteins secreted into this spent medium were 
then precipitated by the addition of ice-cold trichloroacetic acid (TCA) at a final 
concentration of 20%. The protein precipitates were washed with 70 % ethanol, air-dried 
and heat denatured in SDS-PAGE loading buffer (50 mM Tris-HCl, pH 6.8, 0.4% SDS, 6% 
sucrose, 0.01% bromophenolblue), either with or without dithiothreitol (10 mM). To 
estimate the relative amounts of cell-surface-associated proteins in L.jensenii, bacterial cells 
were extracted at room temperature for 15 minutes with SDS-PAGE loadirig buffer. The 
extracted proteins were separated from insoluble debris by centrifugation 14,000 x g for 5 
minutes, and then resolved by SDS-PAGE using a 4-12% NuPAGE system (Tnvitrogen). For 
Western analysis, proteins were electrotransferred on to PVDF membranes (Millipore) 
according to the manufacturer's directions. The membrane was probed with polyclonal 
rabbit anti-CD4 antibodies T4-4 (AIDS Research and Reference Reagent Program of the 
National Institute of Health) or with rabbit anti-M6 of S. pyogenes, or rabbit anti-cyanovirin 
antibodies and then visualized by using alkaline phosphatase (AP)-conjugated anti-rabbit IgG 
(Sigma) and chromogenic detection reagents (Promega). Alternatively, the antigen-antibody 
reaction was visualized by using enhanced chemilluminescent reagents (Amersham 
Biosciences). 

[109] Partial purification of lactobacillus-derived 2D-CD4. Culture 
supernatants were treated with protease inhibitor cocktail (Roche Applied Science), dialyzed 
against 50 mM sodium phosphate, pH 6.8 at 4°C in 10-kDa cut-off dialysis membrane and 
then passed over a column of Q Sepharose Fast Flow resin (Amersham Biosciences). The 
2D-CD4 proteins in the flow through were then bound to SP Sepharose Fast Flow 
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(Amersham Biosciences) and eluted in the buffer containing 0.8 M NaCI. The amount of 
eluted 2D-CD4 was determined by Western blot and activity analyzed by CD4 ELISA and 
gpl20 binding using.soluble CD4 (AIDS Research and Reference Reagent Program of the 
National Institutes of Health) as a standard. The isolated soluble proteins were also subjected 
to N-terminal amino acid sequencing at Molecular Structure Facility, University of 
California-Davis. 

[110] CD4 ELISA and gpl20 binding assays. The concentration of 
correctly folded 2D-CD4 protein was assessed by a CD4-capture ELISA in a 96 well 
microliter plate format. The 2D-CD4 proteins secreted by L.jensenii were captured by the 
CD4-specific monoclonal antibody Sim.4 (McCallus et al Viral Immunol 5:163-720) at 10 
ng/ml. The bound CD4 molecules were detected with a polyclonal rabbit antibody, T4-4 
(AIDS Research and Reference Reagent Program), which was visualized by incubation with 
AP-conjugated anti-rabbit IgG (Sigma) and the AMPAK ELISA amplification system (Dako 
Diagnostics, Inc.). The gpl20 binding activity of 2D-CD4 was performed essentially as 
described by Moore, AIDS Res. Hum. Retrovimses 9: 209-219 (1993) with minor 
modification. Briefly, 96 well microtiter plates coated with a polyclonal sheep anti-gpl20 
antibody, D7324 (Aalto Bio Reagents, Dublin, Ireland), were blocked (TBS, 2% non-fat dry 
milk) and incubated with 10 ng/well recombinant gpl20 (rgpl20, Protein Sciences Corp.). 
Samples containing CD4 molecules were then captured by rgpl20, and bound CD4 
molecules were detected using the polyclonal rabbit antibody, T4-4, as described above. The 
biological activity of lactobacillus-derived cyanovirin was also determined by gpl20 binding 
assay. Samples containing cyanovirin were first capatured by rgpl20, and bound cyanovirin 
molecules were then detected using polyclonal rabbit anti-cyanovirin antibody. 

[Ill] HIV-1 infectivity assay. The effect of 2D-CD4 proteins secreted from 
L.jensenii on HIV-1 entry was determined by using a single-cycle infection assay (Connor et 
al, J. Exp. Med. 185:621-628 (1997); Chen et al, I Virol. 68:654-60. (1994)). Reporter 
viruses expressing luciferase (fflV-l Hx B2 strain Env) were produced by co-transfecting 
HEK293T cells with plasmids encoding the envelope-deficient HIV-1 NL4-3 virus, the pNL- 
Luc plasmid carrying the luciferase reporter gene, and the pSV plasmid expressing the 
glycoproteins from HIV-1hxB2 strain. The supernatant was collected 48 hours after 
transfection and filtered. Virus stocks were analyzed for HTV-1 p24 antigen concentration by 
ELISA. Subsequently, the virus p24 antigen concentration in virus-containing supernatant 
was determined by p24 antigen ELISA. 
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[112] One day before infection, HeLa-CD4 cells expressing CXCR4 were 
seeded at 5 x 10 4 cells per well in a 48-well tissue culture plate. Env-pseudotyped, luciferase- 
expressing reporter HIV- 1 Hx b2 viruses were first incubated with different concentrations of 
2D-CD4 protein (37°C, 30 minutes), either directly from culture supernatants or prepared 
after partial purification. Subsequently, the virus in the reaction mixtures was used to infect 
cells (37°C, 1 hr). Unbound virus was removed by washing with PBS. Alternatively, L. 
jensenii bacteria, resuspended at 2.5 or 5 x 10 8 cells/ml, were cultured in Medium 199 (37°C, 
30 minutes). Then, the bacteria, along with input viruses, were added to HeLa cells 
expressing CD4-CXCR4 (37°C, 1 hour). Subsequently, cells were cultured in Dulbecco's 
modified Eagle's medium supplemented with 10% FBS at (37°C, 48 hours, 5% C0 2 ). The 
cells were washed once with PBS and lysed with reporter lysis buffer (Promega Corp. 
Madison, WI). Luciferase activity in a mixture of luciferase substrate and cell lysate was 
measured in relative light units (RLU) with a Chiron microplate luminometer. 

Results : 

[113] The human vaginal ecosystem harbors a complex microflora. 
Lactobacilli are dominant within the microflora of healthy American and European women, 
and have been shown to adhere tightly to vaginal epithelial cells in vitro. Most healthy 
women are predominantly colonized by either L. crispatus or L. jensenii relative to other 
lactobacillus species. Thus, L. crispatus and L. jensenii were the two species of lactobacillus 
selected for analyzing in vitro growth properties, colonization, endogenous plasmid profiles, 
antibiotic susceptibility, relative transformation efficiency, and level of functional protein 
expression. 

[114] Growth properties: L. jensenii strains were isolated from vaginal 
swabs of healthy individuals. These isolates are useful in the identification of various 
phenotypic traits (e.g. lactic acid production, autoaggregation and adherence to vaginal 
epithelial cells) that will be necessary to maintain in the final engineered product. For 
comparison, reference strains of various vaginal lactobacillus isolates, including I. crispatus, 
L Jensenii, L. gasseri, L.fornicalis andZ. vaginalis, were obtained from the American Type 
Culture Collection. 

[115] Antibiotic susceptibility / plasmid profiles: Environmental (WT) 
isolates of bacteria frequently contain transmissible genetic elements (plasmids, transposons, 
and temperate bacteriophage) that confer a genetic advantage for viability in vivo. The 
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presence of naturally occurring plasmids or integrated transposons, however, could adversely 
affect the stability of plasmid based expression constructs used in our preliminary studies, 
either interfering with replication or with the ability to use antibiotic selection. Alternatively, 
any naturally occurring plasmid isolated from WT L. jensenii could potentially be engineered 
for heterologous protein production without the necessity of antibiotic selection. 
Susceptibility testing was used to determine the range of sensitivity of WT isolates of L. 
jensenii and L crispatus to various antibiotic compounds with defined modes of action. Any 
strains of X. jensenii that were found to be resistant to specific compounds were examined 
more thoroughly for plasmids and transposons that commonly contain resistance genes. Both 
disc diffusion and broth microdilution assays (in MRS) were used to assess antibiotic 
susceptibility. All strains tested were sensitive to erythromycin (< 0.5 [ig/ml) and moderately 
sensitive to tetracycline (4 (ig/ml). Most of the strains were sensitive to oxacillin (< 0.5 
jug/ml) with the exception of two strains that appeared highly resistant (> 128 jag/ml). All 
strains were found resistant (> 128 (ag/ml) to aminoglycosides (kanamycin, gentamycin), 
which is expected for aerotolerant bacteria that do not synthesize respiratory quinones. 

[116] Optimization of electroporation in lactobaciUus: To support 
cloning and heterologous protein expression in these vaginal lactobaciUus strains, 
electroporation methods were developed for application to L. jensenii and L. crispatus* 
Various parameters, including culture media, cell growth stages, DNA concentration, wash or 
electroporation buffer composition, cuvette gap size, and voltage were evaluated to determine 
conditions that improved transformation frequencies for the WT lactobaciUus strains in our 
collection. E. co/f-derived plasmids were transformed into Z. jensenii by electroporation 
according to Luchansky et al (7. Dairy Set 74:3293-302 (1991)) with modifications. 
Briefly, freshly inoculated L. jensenii were cultured in MRS broth to 0.6-0.7 at OD 6 oo at 37°C 
and 5% CO2. The bacterial cells were harvested, washed and re-suspended in 952 mM 
sucrose and 3.5 mM MgCk at 4°C. Using a pre-chilled 0.2 cm gap cuvette, competent cells 
were added with 1- 2 \ig of plasmid DNA {e.g. pOSEL175) and electroporated immediately 
at 2.5 kV/cm and 200 ohms using Gene Pulser II (Bio-Rad, Hercules, CA). Afterward, cells 
were allowed to recover in prewarmed MRS broth for two hours at 37°C, prior to being 
plated on selective MRS agar plate containing 20 |ig/ml erythromycin. 

[117] In contrast to transformations of L. jensenii, clinical isolates of X 
crispatus were not successfully transformed in spite of diligent efforts to do so. The 
following table lists the protocols and success rates achieved for L Jensenii and X crispatus. 
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Protocols fused for attempts to transform L. crispaius included parameters described in: 
Luchanskyetfa/. {J.DaityScL 74:3293-302 (1991); Maassen, J. Immunol Methods 223:131- 
136 (1999) (including two washes of cells in distilled H 2 0, one wash in 30% (w/v) PEG-1000 
followed by electroporation at (8500V/cm 2 ) in 30% PEG-1000, 100 Q, 25mF capacitance); 
andBhowmik & Steele, J. Gen, Microbiol 139:1433-1439 (1993) (involving bacteria grown 
in MRS broth plus 1% glycine, wash four times in buffer (0.5 M sucrose, 0.1 mM HEPES, 
pH 7.0) and electroporation in same buffer, 0.4 cm cuvette, 2.5 kV, 25 200 f2). To date, 
human vaginal isolates of X. crispatus could not be tranformed under any conditions tested. 



Bacteria 


Protocols 


Transformants (CFU/jag 
plasmid DNA) 


L.jensenii 1153 


Electroporation 


5xl0 3 


L.jensenii Xna 


Electroporation 


1-L5xl0 3 


All available L. crispatus 
clinical strains including: 
Xna, 1160, 21 16, 2059, 
6061, and 6090 


Various protocols and 
combinations thereof 
including: 

Different buffers, varied 
electroporation parameters, 
different growth conditions, 
different plasmid replication 
origins, different plasmid 
antibiotic resistance 
markers, and prolonged 
duration of cell recovery 
after electroporation. 
Protocols included heat 
shock of bacteria to 
inactivate host restriction 
systems as well as removal 
of S-layer with chaotropic 
salts prior to competent cell 
preparation. 


0 in all cases 



[118] Development of expression vectors in lactobacillus: Plasmid vectors 
for expression of proteins in lactobacillus were engineered, based on a cloning vector 
(pLEM7) modified from an endogenous plasmid of I. reuteri LEM3 (Fons et al, Plasmid 
37: 199-203 (2002)). One of these such vectors, pOSEL144, contains a multiple cloning site, 
an origin for replication in E. coli (colEl), an additional origin for replication in lactobacillus 
(repA) and an erythromycin resistance marker (erm) for positive selection of the plasmid in 
both E. coli and lactobacillus (Figure 1). 
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[119] A modular rJoning.approach to facilitate the cloning and expression of 
heterologous polypeptides such as HIV-binding ligands. Using different combinations of 
expression strategies, these molecules are designed either to be secreted into the extracellular 
space (i.e. the surrounding media), or to be retained as a surface expressed protein linked by a 
natural covalent modification to the bacterial cell wall (peptidoglycan). The cassettes can 
contain four components, including promoter elements, signal peptide, virus-binding ligand, 
and anchor sequences (Figure 1). For the expression of various ligands, unique restriction 
sites were placed between each module for easy one-step cloning. 

[120] To obtain proof of whether this modular approach would work, several 
signal sequences and anchoring sequences were analyzed to determine the expression of 
biologically active CD4 protein fusions expressed under the control of either the P 2 3 or the 
P 59 promoter. In addition to utilizing the M6 protein signal sequence from S. pyogenes, the 
signal sequences of the a- Amylase (Amy) gene (amyA) from L. amylovorus and the S-layer 
gene (cbsA) from L. crispatus were cloned. Both sequences were obtained by PCR from 
chromosomal DNA isolated from either L. amylovorus or L. crispatus, and were designed to 
contain a ribosomal binding site and span the N-terminal sequence of either gene up to the 
signal peptidase cleavage site. The Amy signal sequence (Amy ss ) was: 
MKKKKSFWLVSFLVIVASVFFISFGFSNHSKQVAQA (SEQ ID NO:3). The CbsA signal 
sequence (CbsA ss ) was: MKKNLRIVSAAAAALLAVATVSA (SEQ ID NO:2). Unique 
restriction endonuclease sites were added to the ends of the primers so that any potential 
ligand protein could be cloned in frame with either signal sequence. The amplified cassettes, 
containing either CbsA^ or Amy ss , were then digested and cloned into the modular system. 

[121] Secretion of heterologous proteins (e.g., 2D-CD4) by modified L. 
jensenii relies on the Sec machinery that mediates translocation of a precursor protein across 
the membrane. The 2D-CD4 protein is first synthesized as signal sequence (e.g., CbsAss) 
bearing precursor. Upon translocation across the membrane, the CbsAss is cleaved by signal 
peptidase, thus releasing the mature 2D-CD4 into the extracellular milieu. The N-terminal 
amino acid sequences of secreted proteins are consistent with the predicted sequences of 
mature protein. 

[122] To express CD4 on the surface of L. jensenii, two cell-wall anchors 
were used from either the M6 protein (emm6) of £ pyogenes or the PrtP protease of I. 
paracasei. Unlike the M6 protein, which has a signature LPXTG cell wall sorting signal, the 
sortase motif for PrtP is LPKTA. The PrtP anchor was prepared by PCR amplification from 
L. paracasei ATCC 25302, then cloned and confirmed by DNA sequence analysis. 
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[123] The first two extracellular domains (K J -S 183 ) of human CD4 (2D-CD4) 
were chosenfor surface expression. 2D-CD4 is capable of high-affinity binding to gpl20 as 
is the 4-domain sCD4. Because differences in codon bias affect protein expression in L. 
jensenii, the cDNA clone of human 2D-CD4 was re-coded to conform more closely to the 
optimal lactobacillus codon usage. The 2D-CD4 sequence containing Nlie I site at its 5' end 
mdMfel site at its 3' end was amplified from recoded CD4 sequence. The resulting PCR 
product was cut by NJte I and Mfe I, and ligated with Nfie I IMfe I-treated vector for surface 
expression of 2D-CD4. The nucleotide sequences of the constructs were verified prior to 
transformation into L. jensenii. 

[124} Surface anchoring of 2D-CD4 in L. jensenii: To determine the 
extent of surface expression of 2D-CD4, a set of constructs harboring different surface 
anchors were analyzed along with pOSEL144, which was used as a negative control. 
Proteins from modified bacteria were fractionated into cell-, cell wall-, and protoplast- 
associated pools. 2D-CD4 was detected under P 59 promoter control in the cell- wall 
associated and protoplast fractions, whereas it was undetectable in cell wall fractions in 
constructs harboring the P23 promoter with either M6 or CWA100 as cell wall anchors, where 
CWA100 represents a cell wall anchoring domain of 100 amino acids in length of S. 
pyogenes. This lack of detectable cell wall associated 2D-CD4 was not due to protein 
instability during cell wall digestion. The inefficient surface anchoring of 2D-CD4 driven by 
cell wall sorting signal either from M6 of & pyogenes or PrtP protease of L. paracasei 
necessitated identification and use of native cell wall sorting signals mL. jensenii. 

[125] Secretion of biologically active 2D-CD4: An analysis of surface- 
anchored constructs demonstrated that a majority of 2D-CD4 was released into the culture 
medium. Furthermore, in vitro binding studies showed that some soluble two- or four- 
domain CD4 molecules did not diffuse freely in biological fluids, but rather became cell- 
surface associated. The growth and adherence properties of CD4-expressing lactobacilli 
appear similar to those of wild-type strains, suggesting that the expression of these virus- 
binding proteins will not interfere with the ability of the engineered lactobacillus to colonize 
the human host. Thus an engineered lactobacillus expressing these CD4 molecules could 
maintain a high local concentration of the molecule without the explicit need for covalent 
attachment to the peptidoglycan. 

[126] In the cassette-based expression system, the anchor domain determines 
whether the expressed virus binding proteins are surface-anchored or secreted. To generate 
secreted forms of the HIV-binding proteins, a stop codon was introduced in the N-terminal 
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region of the cell wail anchoring domain. A series of plasmids designed for protein secretion 
with theibliowing components were constructed: P23 promoter-signal sequence of S. 
pyogenes M6 (M6ss)-M6 l20 -2D-CD4 (pOSELl28); P59 promoter-M6ss-M6i 2 o-2D-CD4 
(pOSELi97); and P23 promoter-CbsAss-2D-CD4 (pOSEL651), where M6 120 represents the 
N-terminal 120 amino acid residues of & pyogenes M6. Both modest and high levels of 
recombinant 2D-CD4 production were achieved in L, jensenii 1153 and Xna transformed 
with pOSEL128, 197, or 65 1 when grown to log phase. The expressed protein continued to 
accumulate, even as the pH of the culture medium became acidified after overnight 
incubation. Figure 2 showed a time-dependent release of 2D-CD4 by L jensenii Xna 
harboring pOSEL65 1 relative to the bacteria harboring control plasmid pOSEL144. To 
characterize the biological activity of the secreted 2D-CD4 protein, CD4 ELISA and gpl20 
binding assays were performed. The CD4 ELISA was developed to determine the presence 
of correctly folded CD4 molecules captured onto a solid phase by adsorbed antibodies Leu3a 
or Sim.4. CD4 hybridoma (Sim.4) and CD4 monoclonal (Sim.4) are available at NIH AIDS 
Research and Reference Reagent Program. The Leu3a mAb is available at Becton 
Dickinson. Both antibodies recognized domain 1 of CD4, overlapping the gpl20 binding 
site. Polyclonal antibodies against CD4 (T4-4) were used to detect bound CD4 molecules. 
Approximately 100 ng/ml of either M6i 20 -2D-CD4 or 2D-CD4 was detectable in the spent 
supernatant (Figure 3). The gpl20-binding activities of secreted M6i 20 -2D-CD4 or 2D-CD4 
proteins correlated well with the results from CD4 ELISA (Figure 3). Thus, 2D-CD4 is 
expressed in a native conformation based on its immuno-reactivity with the conformational 
dependent antibody Sim.4 and ability to bind the counter-ligand gpl20. 

[127] L. jensenii derived 2D-CD4 inhibited HIV-1 infectivity in vitro. 
Several different approaches were used to determine whether 2D-CD4 secreted from strain 
Xna-651 interfered with the ability of an HIV- 1hxb2 virus, carrying a luciferase reporter, to 
infect HeLa cells expressing CD4-CXCR4 in a single cycle viral entry format. 

[128] First, soluble 2D-CD4 secreted into the culture supernatant, by L, 
jensenii (pOSEL651), was partially purified using ion-exchange chromatography. As a 
positive control, biologically active 2D-CD4 proteins were added to samples prepared from 
the control strain (pOSEL144). Similar to the positive control, samples from L. jensenii 
(pOSEL651) inhibited HIV-1 infectivity in a dose-dependent manner, whereas samples 
prepared from the control strain (pOSEL144) had no apparent effect (Figure 4). 

[129] Second, conditioned media was prepared by incubating L. jensenii at 
log phase in a HeLa cells compatible medium, Medium 199. Based on both CD4 ELISA and 



33 



WO 2004/007695 



PCTAJS2003/007233 



HIV 1 gpl20 binding analysis, biologically active 2D-CD4 at 50 xig/5xl0 8 colony-forming 
unit (CFU)/nil was routinely detected in the conditioned supernatant of LJensenii 
(pOSEL65 1). Addition of this conditioned media to HeLa-CD4 cells inhibited HIV-1 
infectivity in a dose-dependent manner. Once again, inhibitory concentrations of L. jensenii- 
derived 2D-CD4 were similar to those of an added 2D-CD4 control standard, while the 
conditioned media from LJensenii (pOSEL144) was inactive (Figure 4). 

[130] Co-incubation of L.jenseiiii Xna-651 bacteria with recombinant 
virus inhibited HIV-1 viral entry of target cells in vitro. In a separate set of studies, 
experiments were performed to evaluate whether co-incubation of L. jensenii Xna-651 with 
HeLa cells expressing CD4-CXCR4 and recombinant virus could decrease HTV-1 infectivity 
(Figure 5). L. jensenii Xna-651 were pre-incubated in medium 199 and then added along 
with recombinant virus to HeLa cells expressing CD4-CXCR4 cells. A determination of 
virus-associated luciferase activity at 48 hours post infection revealed that the LJensenii 
Xna-651 strain had a significant inhibitory effect on HIV-1 entry, whereas the control strain 
Xna-144 did not. L. jensenii Xna-65 1 inhibited HIV-1 entry by 79% (P < 0.01) at cell 
density of 2.5 x 10 8 /ml or by 90% (P < 0.01) at cell density of 5 xl0 8 /ml, relative to L 
jensenii Xna-144. Similar results were observed when bacteria and viruses were added to 
cultured cells without pre-incubation of the bacteria in medium 199. Taken together, these 
results demonstrate clearly that a vaginally colonizing isolate of L. jensenii, engineered to 
express 2D-CD4, markedly inhibits the infection of cultured cells by a recombinant HTV-1 
virus, thereby providing an in vitro demonstration of this anti-infective strategy. 

[131] Expression of cyanovirin-N in LJensenii. To further demonstrate 
the versatility of the expression system, the 2D CD4 coding sequence in plasmid pOSEL651 
was removed and replaced with the structural gene encoding cyanovirin-N, resulting in 
pOSEL647. To determine if the L. jensenii 1 153 or Xna harboring pOSEL647 secrete any 
cyanovirin, proteins in conditioned media at different stages of growth were TCA 
precipitated, washed with ethanol, and resolved using a 4-12% reducing SDS-PAGE. 
Western blot analysis using polyclonal anti-cyanovirin antibodies detected modest level of 
lactobacillus-derived cyanovirin at the predicted sizes. Furthermore, the secreted cyanovirin 
bound HIV-1 gpl20, suggesting that the expressed proteins are to a large extent properly 
folded into their native conformation.. 

[132] Expression of HveC in L. jensenii. The human herpesvirus entry 
mediator C (HveC) is a member of the immunoglobulin family used as a cellular receptor by 
HSV (e.g. Geraghty et al t Science 280:1618-1620 (1998)) and its gD binding domain located 
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within the Y-domain of HveC {e,g. Krummenacher et aL, 1 Virol 73:3127-3137 (1999)). 
Point mutations within the V domain of HveC have been shown to eliminate glycoprotein D 
binding and, accordingly, impair entry activity for both HSV-1 and HSV-2 (Martinez & 
Spear, J, Virol 76:7255-7262). In addition, a natural soluble isoform, designated HveCy, 
also modulates susceptibility of host cells to HSV infection (<?. g. Lopez et aL, 1 Virol 
75:5684-5691 (2001)). To secrete HveC in fusion with M6120 in L, jensenii 1 153, HveC 
expression plasrnids were constructed under control of P59 promoter and CbsAss. These 
plasmids included pOSEL215, designed for secretion of M6i2o-HveC V-domain, pOSEL220 
for secretion of M6i2o-HveCy and a control plasmid, pOSEL175. All of these plasmids were 
introduced into Z. jensenii 1 153 by electroporation. To determine if these bacterial strains 
secrete any HveC, proteins in conditioned media at different stages of growth were TCA 
precipitated, washed with ethanol, and resolved using a 4-12% reducing SDS-PAGE. 

[133] Western blot analysis using polyclonal anti-M6 antibodies and anti- 
HveC detected modest level of both M6t 2 o-HveC V domain (Figure 6) and M6 U o-HveCy at 
the predicted sizes. In addition, the ability of the M6i 2 o-HveC V domain to react with mAb 
Rl .302 was analyzed in western blot format, following Tris-Glycine PAGE (e.g. 
Krummenacher era/., J. Virol 73:8127-8137 (1999)). M6 U o-HveC bound to mAb R1.302 
(which recognizes a conformational^ dependent epitope within the V-domain), 
demonstrating the secreted M6120-HveC V protein adopts a native conformation. 

The above example is provided to illustrate the invention but not to limit its scope. 
Other variants of the invention will be readily apparent to one of ordinary skill in the art and 
are encompassed by the appended claims. All publications, databases, Genbank sequences, 
patents, and patent applications cited herein are hereby incorporated by reference. 
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WHAT TS CLAIMED IS : 

1 LA mucosal-surface .colonizing Lactobacillus jensenii bacteria 

2 recombinantly altered to express a biologically active protein. 

1 2. The bacteria of claim 1 , wherein the mucosal surface is a vagina. 

1 3. The bacteria of claim 1 , wherein the mucosal surface is a 

2 gastrointestinal tract. 

1 4. The bacteria of claim 1 , wherein the protein requires a disulfide bond 

2 to be biologically active. 

1 5 . The bacteria of claim 1 , wherein the biologically active protein is 

2 secreted. 

1 6. The bacteria of claim 1 , wherein the biologically active protein is on 

2 the bacterial surface. 

1 7. The bacteria of claim I , wherein the protein comprises a signal 

2 sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2 and SEQ ID 

3 NO:3. 

1 8. The bacteria of claim 1 , wherein the protein comprises a cell wall 

2 anchoring signal sequence. 

1 9. The bacteria of claim 8, wherein the cell wall anchoring signal 

2 sequence comprises SEQ ID NO:4 or SEQ ID NO:5. 

1 10. The bacteria of claim 1 , wherein the biologically active protein binds 

2 to a pathogen when the biologically active protein is contacted with a pathogen. 

1 11. The bacteria of claim 10, wherein the pathogen is a bacterial pathogen. 

1 12. The bacteria of claim 10, wherein the pathogen is a fungal pathogen. 

1 13. The bacteria of claim 1 0, wherein the pathogen is a viral pathogen. 

1 14. The bacteria of claim 13, wherein the viral pathogen is HIV. 
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1 15. The bacteria of claim 13, wherein the viral pathogen is~H3V. 

1 16. The bacteria of claim 10, wherein the biologically active protein is 

2 CD4 or an HIV-binding fragment of CD4. 

1 1 7. The bacteria of claim 1 6, wherein the biologically active protein is 2D- 

2 CD4. 

1 18. The bacteria of claim 1 0, wherein the biologically active protein is 

2 cyanovirin-N or a virus-binding fragment of cyanovirin-N. 

1 19 The bacteria of claim 1 0, wherein the biologically active protein is 

2 HveC or a virus-binding fragment of HveC. 

1 20. The bacteria of claim 1, wherein the biologically active protein is a 

2 therapeutic polypeptide. 

1 21. The bacteria of claim 1 , wherein the biologically active protein is a 

2 polypeptide vaccine. 

1 22. The bacteria of claim 1, wherein the biologically active protein has a 

2 molecular weight of between 500 and 40,000 daltons. 

1 23 . A method of providing a biologically active protein to a mammalian 

2 mucosal surface, the method comprising, 

3 contacting a mucosal surface with a Lactobacillus jensenii bacteria 

4 recombinants altered to express a biologically active protein in an amount able to be 

5 detected in a sample collected from the mucosal surface. 

1 24. The method of claim 23, where in the mucosal surface resides within 

2 the vagina. 

1 25. The method of claim 23, where in the mucosal surface resides within 

2 the gastrointestinal tract. 

1 26. The method of claim 23, wherein the contacting step comprises orally 

2 administering the Lactobacillus jensenii bacteria. 
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1 27. The method of claim 23 , wherein the contacting step comprises 

2 vaginally administering the Lactobacillus jensenii bacteria. 

1 28. The method of claim 23, wherein the contacting step comprises 

2 rectally administering the Lactobacillus jensenii bacteria. 

1 29. The method of claim 23, wherein the protein requires a disulfide bond 

2 to be biologically active. 

1 30. The method of claim 23, wherein the biologically active protein is 

2 secreted. 

1 31. The method of claim 23, wherein the biologically active protein is on 

2 the bacterial surface. 

1 32. The method of claim 23 , wherein the biologically active protein binds 

2 to a pathogen when the biologically active protein is contacted with a pathogen. 

1 33 . The method of claim 32, wherein the pathogen is a bacterial pathogen. 

1 34. The method of claim 32, wherein the pathogen is a fungal pathogen. 

1 35. The method of claim 32, wherein the pathogen is a viral pathogen. 

1 36. The method of claim 35, wherein the viral pathogen is HIV. 

1 37. The method of claim 35, wherein the viral pathogen is HSV. 

1 38. The method of claim 36, wherein the biologically active protein is CD4 

2 or an HIV-binding fragment of CD4. 

1 39. The method of claim 38, wherein the biologically active protein is 2D- 

2 CD4. 

1 40. The method of claim 32, wherein the biologically active protein is 

2 cyanovirin-N or a virus-binding fragment of cyanovirin-N. 

1 41 . The method of claim 32, wherein the biologically active protein is 

2 HveC or a virus-binding fragment of HveC. 
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1 42. The method of claim 32, wherein the biologically active protein is a 

2 therapeutic polypeptide. 

1 43. The method of claim 23, wherein the biologically active protein has a 

2 molecular weight of between 500 and 40,000 daltons. 

1 44. An expression cassette comprising a polynucleotide encoding a 

2 polypeptide, wherein the amino-terminus of the polypeptide comprises a heterologous 

3 bacterial signal sequence at least 95% identical to SEQ ID NO:2. 

1 45. The expression cassette of claim 44, wherein the bacterial signal 

2 sequence comprises SEQ ID NO:2. 

1 46. A vector comprising the expression cassette of claim 44. 

1 ( 47. A cell comprising the vector of claim 46. 

1 48 . The cell of claim 47, wherein the cell is a gram-positive bacterium. 
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A. Shuttle vector 



Figure 1 



BsaHl 
Bbsl, \ 
Ppu10l.» l 
NsflM \ 
NdelA\ 



Bell. 




Bst1107 
Pvul' 



Eam1105 / 
Asull 



pOSEL144 

^ 3ooo 4227 bps 




...Hindi 



Hindlll 



BamHI 
Xbal 
Noll 
Xmalll 
-BstXI 
Msll 
Sacil 
Banll 
EC1136 
Sad 
Gsul 



Drdl 



B. Expression cassette for protein secretion 
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C. Expression cassette for protein anchoring 
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Figure 4 
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Figure 6 
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